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Abstract — Representation of a swarm of independent robotic 
agents under graph-theoretic constructs allows for more formal 
analysis of convergence properties. We consider the local and 
global convergence behavior of an N -member swarm of agents 
in a modified consensus problem wherein the connectivity of 
agents is governed by probabilistic functions. The addition 
of a random walk control ensures Lyapunov stability of the 
swarm consensus. Simulation results are given and planned 
experiments are described. 

I. Introduction 

Control of a swarm of robots may be achieved in many 
different ways. The seminal work of Reynolds (1987) [1] 
gave the first algorithmically efficient representation of a 
flock with very simple rules. Reynolds' work was ground- 
breaking, allowing simulation of the group dynamics in- 
volving very large numbers of members. Interestingly, the 
fundamental work of Braitenberg [2] in the area of behavioral 
robotics predates Reynolds' research, though that work was 
focused mostly on individual robots or robot pairs. This 
idea was further developed by additional research in swarm 
robotics by Mataric [3] and Parker [4]. Early work was quite 
ad-hoc, but behavioral control continues to find an increas- 
ingly rigorous mathematical framework. For variations on 
this topic, see, e.g., [5]-[14]. 

Swarming laws may be applied a variety of cooperative- 
robotics settings. Each of these settings, though, has a 
primary goal of achieving agreement or consensus. The 
specific meaning of agreement in each scenario is different, 
but in general the implication is that the states of each robot 
evolve in a coordinated way to achieve some goal. A few 
specific types of agreement are creation and maintenance 
of formation [7], [13]— [15], robot rendezvous [16], plume 
localization [17], and robot segregation [18]. 

Formation control may be accomplished under several 
control regimes. Clearly, a centralized controller could drive 
robots to a desired configuration under straightforward tra- 
jectory control. It is much more difficult and interesting to 
develop such capability under a distributed control paradigm, 
allowing greater (if not unlimited) scalability of the swarm. 
Distributed control of formations falls into the control 
regimes of potential functions, geometric control, and Graph- 
based control. Several additional examples of distributed 
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formation can be found in the works of Yun [19], Balch 
[7], Yamaguchi [20], and Antonelli [21]. 

Potential field functions are primarily used to balance com- 
peting objectives. Generally constructed as inverse-square 
functions of distance (inspired by physical models of charged 
particles), potential field functions yield a natural method 
of balancing the relative importance of such control goals 
as obstacle avoidance, goal-seeking, and robot following 
(or avoidance). The use of potential field functions for 
construction and maintenance of formations has been studied 
and reported by, among others, Lewis [22], Leonard [23], 
Olfati-Saber and Murray [12], [24], [25], Bruemmer [26], 
and Mai [27]. 

II. Graph Representation 

In recent years, several researchers have explored the 
use of graph-theoretic concepts to provide a more abstract 
distributed control. Early works in graph-theoretic formation 
control by Olfati-Saber and Murray [28] and Tanner, Jad- 
babaie, and Kumar [13], [14] showed promise. 

A brief discussion of graphs is in order. Graph notation 
varies in the literature; the following notation will be used 
in this paper. An undirected graph Q = G(V, E) of order N 
consists of a set of N nodes or vertices Vg = {1, 2, N} 
and a set of connections or edges Eg C {Vg x Vg}. If a 
pair of nodes is in the set of edges, € Eg, the nodes 

are adjacent and are called neighbors, indicated by i <~ j 
or ij. The set of neighbors of a node i is given by M% — 
{j £ Vg \ j ~ i}. Note that in this case, loop edges, e.g., 
E Eg, are allowed. 

When it is clear from the context, the subscript will be 
dropped, e.g., V = Vg, E = Eg, and Q(V, E) = Q{V g ,E g ). 

A path from node i to j is an ordered set of nodes, starting 
at i and ending at j, such that each consecutive pair is in E. 
If a path exists from every pair of nodes (i, j) in the graph, 
the graph is said to be connected. The diameter of the graph 
is found by finding the minimum path length for each node 
to every other node; the maximum of these minimum length 
paths is the graph diameter. 

The degree of a node is the cardinality of Mi, denoted 
by Aj = \Ni\. The vector of degrees of the nodes of Q is 
denoted Ag = [Ai, A,, A N ] T . If A, = N - 1 V i e 
Vg, i.e., every node is connected to every other node except 
itself, then graph Q is called complete. A complete graph of 
order N is also denoted by K N . 

The complement of graph Q(V,E) is denoted Q, and is 
defined on the same node set as Q but with all disconnected 
nodes in Q are connected, and all connected nodes in Q are 



disconnected. Formally, given a graph Q(V,E), its comple- 
ment is Q = G(V,E),Eg = G {Vg x Vg} \ $ 

Eg,i^j). 

The adjacency matrix is an important data structure in the 
analysis of graphs. The adjacency matrix, for an undirected 
graph, is a representation of the neighboring relationships 
among all agents. The adjacency matrix is defined as shown 
in Eq. ([TJ. 

1^ = 11 . (1) 

I (J otherwise 

The adjacency matrix of undirected graph Q is symmetrix 
and positive semi-definite. Most importantly though, the 
adjacency matrix leads to the graph Laplacian, defined as 
in Eq. 

C(G) = diag(A£) - A. (2) 

Note that the degree vector, AQ, is equivalent to the vector 
of row-sums of A. The eigenvalues of the graph Laplacian 
give information regarding the level of connectivity of the 
graph. 

By its definition, C(Q) will be rank deficient and will 
have at least one zero eigenvalue with eigenvector 1^. 
Beyond this, the total number of zero eigenvalues indicates 
the number of connected components of the graph. For 
example, C(Q) achieves maximum rank of N — 1 when 
the graph is fully connected, thus there is a single zero 
eigenvalue, indicating that the graph is contained in a single 
connected group, i.e., the graph is connected. If there are two 
zero eigenvalues, this indicates that the graph contains two 
connected subgraphs, i.e., one subgraph with N— r connected 
nodes and a second connected subgraph with r connected 
nodes, for any r £ [1,N — 1]. 

The values of the eigenvalues also have significance. If the 
set of eigenvalues, A.£(g) are re-ordered by magnitude, e.g., 
= Ai < A2 < ... < Xn, then the value of A2 indicates the 
algebraic connectivity of Q. As already mentioned, A2 = 
if and only if the full graph is not connected. If A2 7^ 
however, its value is an indicator of the average degree of 
the graph. It is also closely related to the graph diameter Dg ; 
in fact, the lower bound for a connected graph is A/(NxDg) 
[29]. 

Closely related to the above definitions is the weighted 
graph, wherein each edge has an associated weight. Thus, 
the weighted adjacency matrix is defined as in Eq. ([3]): 



otherwise 



(3) 



These weights may be representative of many different phys- 
ical quantities, e.g., they may represent the physical distances 
between the nodes in Euclidean space, Wij — II^Q — ^j'||2? <t 
cost (energy, time, etc.) associated with traversing the edge, 
etc. The weighted graph Laplacian, C{Q W ), is defined in the 
same way, i.e., the degree vector is the vector of row-sums 
of A w . 

Additional information found within the spectra of C(Q), 
A, £(Gw), and A w , are discussed in the 1988 monograph of 



Cvetkovic et al [30], as well as in [3 1]— [37]. These books 
will be used as guides in the development of graph-based 
control in the proposed research. 

Graph-theoretic control of formations is not novel; the 
reported work of Tanner, Jadbabaie, and Kumar [13], [14] 
expounded on this topic. More recently, an extensive set 
of research has been conducted regarding the controllability 
of such swarms under graph-theoretic constructs. Mesbahi 
published several papers [38]-[40] on controllability within 
a swarm for agreement, along with several related works by 
Hatano, Das, Rahmani, Chen, Kim, and Tan relating to the 
study and manipulation of the graph Laplacian and its spectra 
for swarm control [15], [41]-[44]. Of particular focus in the 
proposed research is the extension of such graph-theoretic 
controllability concepts to random graphs along the lines of 
Hatano and Mesbahi [45], [46]. The extension to the dual of 
controllability, observability, has additionally been explored 
by Mesbahi and Zelazo [47], [48]. 

The key element in the works of Mesbahi is the notion of 
the state-dependent dynamic graph. The construct of import 
is a mapping, g s , from the collective system state x = 

[xx,x 2 , ...,x N ] T , ii e 1", le{iixi 2 x...xiiv}c 

R Nn , to the graph Q. 

The state-dynamic graph mapping g s : X — >• Q may be 
defined in many ways, provided g s is a distance function. 
A commonly used and intuitive mapping is the Euclidean 
distance, 

g s = {ij G E I \\xi - £j|| 2 <p, p>0, i^j} (4) 

for a specified p. The 2-norm has an obvious physical 
meaning that is directly applicable to real connectivity prob- 
lems such as wireless networking (where p indicates the 
communication range), or a laser rangefinder (or rangefinder 
pair) with full 2tt angular coverage or an array of vision 
sensors (where p is the detection range for these sensors). 
Alternatively, any p— norm may be used in a similar fashion, 
though with somewhat lower physical meaning. 

One such coordinated goal is aggregation. For example, 
see Equations ([5]) and (|6]l for a swarming law of this type, 
where xi (t) is the n-dimensional state of agent i at time-step 
t, Ui{t) is the self-determined control of agent i at time-step 
t, \&(t) is an objective function possibly independent of the 
swarm interaction, Qi is the subgraph of Q including exactly 
the edges observed by agent i, £(Gi) is the graph Laplacian 



of this subgraph, and b bj , b ag 



are the relative scaling 



gains. This model uses simple, n— dimensional, first-order 
dynamics. Additionally, Xj(t) is the ordered, collective state 
of all agents in Afi(t). 

Xi(t + 1) = Xi (t)+Ui(t) (5) 
Ui(t) = b obj V(t) - b agg C(gi)Xi(t) (6) 

In order for the aggregation to be guaranteed to occur 
for b bj = 0, the initial system graph Q(t = 0) must be 
connected, i.e., 



X(0) G {I 



• iVii 



rank(C(g s (X(0)))) = N-l}. 



Alternatively, if b t,j > and the objective function 4" 
represents a random walk (e.g., a sample from a uniform 
distribution on [—0.5,0.5] taken during time i), the proba- 
bility of successful agreement is 



lim P {agreement) 

t— >oo 



1. 



for finite N and closed space X. Note also, though, that for 
larger values of N, motion of clusters within the graph will 
decrease if the objective function, \P, is truly a random walk. 
Thus modifications to the objective function should be made 
to guarantee continued motion of the swarm. Simple rules 
include behavioral motion such as that proposed by Reynolds 
[1], Mataric [3], or Parker [4]. 

This research work will consider the use of graph-theoretic 
values, such as the degree vector AQ or the eigenvalues 
of the graph Laplacian C(Q) in manipulating the relative 
scaling (i.e. importance) of the swarming and objective laws. 
Before considering this, though, we must first consider the 
characteristics of the consensus of the swarm. 

III. Proof of Consensus Stability 

Let D C M d (d = 2) be compact and convex (and so 
simply connected). For i = 1, . . . N, Xi(t) G D and Xi(0) ~ 
UNIF(D). Let MW — {j G V : \\xi(t) - Xj(t)\\ < R} 
for some R > 0. Here, R is the vision radius. Typically, 
R < diam(D). Note A/i(t) ^ since i G M(*)- 

The dynamics of the particles are given by the agreement 
algorithm plus a noise term 



x l (t + l) = \H l {t)\- 1 



(7) 



Because of the noise term, the particles do not converge to 
consensus. However, once all the particles are close to one 
another, they stay close. 

Proposition 1: Taking the dynamics defined by Eq. [7] 
suppose the to* are i.i.d. random variables drawn from a 
distribution that is absolutely continuous with respected 
to lebesgue measure and supported on some open set U 
containing the origin. If diam(U) < R/2, then 

m.ax\\xi(t)-Xj(t)\\ < R=>max\\Xi(t+l)—Xj(t+l)\\ < R. 

i-j i-j 

Proof: 

max ||x(t+l) — xj(t + 1)| 

i,3 

1 N 1 N 



k=l 

: max||w* - w *| 

%,3 



k=l 



<max(||u;||| + ||w'||) < R 



We now wish to show that eventually the particles must 
be close to one another. 

Lemma 1: Taking the dynamics defined by Eq.|7] suppose 
the w\ are i.i.d. random variables drawn from a distribution 
that is absolutely continuous with respected to lebesgue 



measure and supported on some open set U containing the 
origin, and let E C D be open. Then there is a positive 
constant [i and a deterministic time T such that for all 
i = 1,...,N and t > 

P(xi{t + T) e E) >n. 
Proof: Suppose not. Then there is an open E C D, 
some agent a G V, and some time s > such that 
P(x a (t) e Bo) = for all t > s. Take r > such that 
U contains the ball of radius r centered at the origin. Define 
the sets E k +i — {x £ D : d(x,E k ) < r}, and let 



Ca(t) = \jV Q (t)\- 



/ , - v 3 

jeJVc.(t) 



Xj(t) 



be the center of agent a's communication group at each time 
t. For k > 0, if c a {t) £ E k , then P(x a (t + 1) G E k _ x ) > 
since (c a + U)DE k ^i is non-empty and open and the are 
drawn from a measure absolutely continuous with lebesgue 
measure. So we must have c a (t) E\ for all t > s; i.e., 
P{c a (t) e E\) — for t > s. Now, each E k is open, 
E k C E k+ i and D is compact. Thus, there is a finite positive 
integer F such that D C Ep. By finite recursion, P(c a (i) G 
Ep) = for all t > s, a contradiction. ■ 
Theorem 1: If diam(U) < R/2, there is a random time 
t such that 



P (max \ \xi(s) — Xj(s)\\ < R) 



1 



for all s > t. 

Proof: Take any ball of radius R Br c I?. By the 
lemma, there is a positive constant // and a deterministic 
time T such that P(xi(T) G > ^ for all x 4 (0). Then 
set 

' 1 x 4 (fcT) G B R 
else. 



The li(k) are Bernoulli random variables indexed by i = 
1, . . . , N and k = 1,2, ... . Almost surely, there is fco > 1 
such that 



l[U(ko) = l. 



Thus set t = fcoT an d a Pply Proposition 1. ■ 
A. Discussion 

The theorem takes advantage of the random walk induced 
by the noise. In the appropriate product space, there is a 
unique Lyapunov attractor, and eventually the dynamics will 
find it. Significantly, noisy perturbations considered are quite 
general, so it's possible to apply the theorem if the noise isn't 
symmetric about the origin. 

The convexity of the space D is never used in the proof. It 
is needed so that the averages in the agreement algorithm are 
well-defined. If we're careful about defining line of sight and 
which agents participate in the averaging, the assumption can 
be relaxed to a space that is compact with open interior. This 
allows for spaces with obstacles. If the space is not compact, 
the agents may wander off in different directions. This can 
be addressed by giving the agents a notion of "center" of the 
space so the random walks are recurrent. 



IV. Probabilistic Connectivity 



Experience with real systems quickly reveals that sensing 
and communication ranges are rarely deterministic with 
range. Intuitively, likelihood of successful communication 
or detection is a function of range, but is more accurately 
modeled as a decreasing probability function. 

Intuitively, the resolution of most spatial sensors, e.g. 
cameras or scanning laser rangefinders, is defined in angular 
space. Thus the spatial resolution decreases with distance; 
correspondingly, the minimum detectable feature size in- 
creases with distance. Based on this observation, we posit 
that the detection probability decreases inversely with the 
range. Validation of this assumption in simulation and on 
our experimental testbed is currently underway, and will 
be reported in the sequel. Sensing ranges in simulation are 
commonly represented as a simple circle. Clearly though, 
few sensors have full 360° coverage, thus the orientation of 
the robot (and its on-board sensor) must be considered in the 
sensing function as well. 

The development of network technologies leading to the 
World Wide Web have created a robust system for com- 
munication. When networks are wired, communication can 
generally be assumed to be reliable to within a small amount 
of timeline jitter. However, with mobile roboitcs when the 
communication must be performed wirelessly, many factors 
can contribute to lost communication. The foremost limita- 
tion is range, as this dictates the level of coherent radio- 
frequency power that can be reliably transmitted between 
two nodes. There are many other factors that contribute 
to wireless communication reliability, but our model will 
focus on these. The simplest model of radio-frequency power 
propagation - that of a point source and an outgoing spherical 
wave - yields that the power decreases with the square of 
range. 

With these observations, we generalize the connectivity to 
the mathematically more tractable (with respect to continuity 
of derivatives) decreasing exponential function. The most 
basic probabilistic connectivity function we will consider is 



P((i,j)GE) = e 



= p -dij/p 



where 



dij — \\Xi 



(8) 



(9) 



and p is a shaping constant. Clearly though, this function is 
non-zero for d < oo, thus the asymptotic convergence on a 
non-infinite domain can be easily seen. This is not realistic, 
as there must be a finite cut-off for these connectivity 
functions. Thus we define a maximum range, R, such that 

P((i,j)eE\d ij >R)±0. 

V. Simulation 

Evolution of swarms under the influence of the averaging 
and probabilistic edge-formation described above have been 
investigated in simulation. A practical issue observed during 
these simulations is that the averaging effect of the consensus 
rule results in a decrease in the large-scale random motion. 




Fig. 1. Cluster formation for 20 agents a) at first iteration (uniform 
distribution); b) after 20 iterations; c) after 13131 iterations; d) after 13150 
iterations. 



As all of the agents are subject to random perturbation 
sampled from the same distribution, we should expect that 
the average of this noise results in locally constrained motion. 
However, as seen above, the averaging method described 
above relies achieving full connectivity, which in turn relies 
on the ability of the swarm to continue its random motion. 
Simply increasing the magnitude of the random motion 
does not solve this problem, as the averaging still occurs 
and increasing the magnitude too much may overcome the 
swarm's ability to stay connected. 

In response, a navigation rule is introduced where in the 
magnitude of the random perturbation is a function of the 
individual agent's degree. With this control, individual agents 
in the interior of a cluster exhibit a larger magnitude of 
random motion, such that the cluster averaging will not filter 
the motion. Additionally, given the interior agent's location 
within the cluster, it is less likely that it's motion in a single 
iteration will be of large enough magnitude to break all of 
its connections. Thus greater cluster mobility can be induced 
without increasing the likelihood of separation. Results from 
a simulation of this form are given in Fig. [T] 

VI. Planned Experiments 

For verification of the methods described in this paper, 
an experimental setup with real robots is employed. The 
arena for robot exploration is an area that is roughly 5m x 
3.5m. The agents in this experiment are Create mobile robots 
from iRobot. A motion capture video system comprised of 
twelve Opti Track V100R2 cameras and the Tracking Tools 
API from NaturalPoint is utilized to provide an indoor GPS 
capability. Each Create robot is assigned a unique pattern of 
reflective markers and a corresponding "rigid body" defini- 
tion in the software, allowing reasonably robust tracking of 
the agents at 100Hz. 

These robots are clearly governed by dynamics that are 



more complex than single-integrator particles. Experiments 
will be conducted utilizing real on-board cameras and scan- 
ning laser rangefinders (see [49]) to ascertain the limitations 
of this controller under these dynamics, as well as to un- 
derstand the rate of convergence under these considerations. 
Additionally, the probabilistic functions proposed in this 
paper will be empirically verified. Specifically, the shapes 
of the sensing functions in two (or more) dimensions must 
be addressed. 

VII. Conclusion 

Swarm agreement under certain probabilistic connectivity 
limitations has been proven to be asymptotically convergent 
for bounded spaces. A controller has been developed that 
conforms to the assumptions in the proof, and simulations 
verify the convergence under such a controller. Planned 
experiments will additionally validate these results. 
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